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Abstract: This paper considers gossiping among mobile agents in graphs: agents move on 
the graph and have to disseminate their initial information to every other agent. We focus 
on self-stabilizing solutions for the gossip problem, where agents may start from arbitrary 
locations in arbitrary states. Self-stabilization requires (some of the) participating agents to 
keep moving forever, hinting at maximizing the number of agents that could be allowed to 
stop moving eventually. 

This paper formalizes the self- stabilizing agent gossip problem, introduces the quiescence 
number (i.e., the maximum number of eventually stopping agents) of self-stabilizing solu- 
tions and investigates the quiescence number with respect to several assumptions related to 
agent anonymity, synchrony, link duplex capacity, and whiteboard capacity. 
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Quietude des commerages auto-stabilisants parmi des 
agents mobiles dans les graphes 

Resume : Dans cet article, nous considerons le probleme du commerage parmi des agents 
mobiles evoluant dans un graphe : les agents se deplacent dans le graphe et doivent disscmincr 
lcur information initiale a tous les autres agents. Nous nous restreignons aux solutions 
auto-stabilisantes pour ce probleme, ou les agents peuvent demarrer depuis des positions 
arbitraires dans des etats arbitraires. La propriete d'auto-stabilisation impose que certains 
des participants continuent de se deplacer indcfiniment, ct mcnc au probleme dc maximiscr 
le nombre d'agents autorises a s'arreter au bout d'un temps fini. 

Cet article formalise le probleme du commerage auto-stabilisant ct introduit l'indice de 
quietude (i.e., le nombre maximum d'agent autorises a s'arreter au bout d'un temps fini) 
des solutions auto-stabilisantes. Nous etudions revolution de l'indice de quietude suivant 
les hypotheses liees a l'anonymat des agents, la synchronic, la bidirectionalite des liens, et 
la presence de tableaux blancs. 

Mots-cles : Systemes distribues, Algorithme distribue, Auto-stabilisation, Agents mobiles 
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1 Introduction 

Distributed systems involving mobile entities called agents or robots recently attracted a 
widespread attention as they enable adaptive and flexible solutions to several problems. 
Intuitively, agents 1 are mobile entities operating in a network that is modeled by a graph; 
agents have limited computing capabilities and are able to move from a node to one of its 
neighbors. The gossip problem among mobile agents was introduced by Suzuki et al. [9, 10] 
as one of the most fundamental schemes supporting cooperation among mobile agents. The 
problem requires each agent to disseminate the information it is initially given to all other 
agents. Suzuki et al. [9, 10] investigated the problem of minimizing the number of agent 
moves for the gossip problem in fault-free networks, and presented asymptotically optimal 
distributed solutions on several network topologies. 

With the advent of large-scale networks that involve a total number of components in the 
order of the million, the fault (and attack) tolerance capabilities become at least as important 
as resource minimization. In this paper, we consider the gossip problem in networks where 
both agents and nodes can be hit by unpredictable faults or attacks. More precisely, we 
consider that arbitrary transient faults hit the system (both agents and nodes), and devise 
algorithmic solutions to recover from this catastrophic situation. The faults and attacks are 
transient in the sense that there exists a point from which they don't appear any more. In 
practice, it is sufficient that the faults and attacks are sporadic enough for the network to 
provide useful services most of the time. Our solutions are based on the paradigm of self- 
stabilization [4], an elegant approach to forward recovery from transient faults and attacks 
as well as initializing large-scale systems. Informally, a self-stabilizing system is able to 
recover from any transient fault in finite time, without restricting the nature or the span of 
those faults. 

Related works. Mobile (software) agents on graphs were studied in the context of self- 
stabilization e.g. in [1, 6, 7, 8], but the implicit model is completely different from ours. In 
the aforementioned works, agents are software entities that are exchanged through messages 
between processes (that are located in the nodes of the network), and thus can be destroyed, 
duplicated, and created at will. The studied problems include stabilizing a network by means 
of a single non-stabilizing agent in [1, 7], regulating the number of superfluous agents in [6], 
and ensuring regular traversals of k agents in [8] . 

In this paper, we follow the model previously used in [2], that studies necessary and 
sufficient condition for the problems of naming and electing agents in a network. The model 
assumes that the number of agents is fixed during any execution of the algorithm, but can 
start from any arbitrary location in the network and in any arbitrary initial state. Agents 
can communicate with other agents only if they are currently located on the same node, 
or make use of so-called whiteboards - public memory variables located at each node. Of 
course, whiteboards may initially hold arbitrary contents due to a transient fault or attack. 

1 Agents and robots can be used interchangeably in this paper. 
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Tabic 1 : Quiescence numbers of the fc-gossip problem 
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Our contribution. The contribution of this paper is twofold: 

1. We introduce the quiescence number of self-stabilizing agent-based solutions to quan- 
tify communication efficiency after convergence. Self-stabilizing agent-based solutions 
inherently require (some of the) participating agents to keep moving forever. This 
hints at maximizing the number of agents that could be allowed to stop moving after 
some point in every execution. The quiescence number denotes the maximum possible 
number of stopping agents 2 for a given particular problem. 

2. We study the quiescence number of self-stabilizing fc-gossiping (that denotes the gos- 
siping among k agents). The quiescence numbers we obtain are summarized in Table 
1, where "—1" represents impossibility of O-quiescence (that is, the problem is impos- 
sible to solve in a self-stabilizing way, even if agents are all allowed to move forever). 
We consider the quiescence number under various assumptions about synchrony (syn- 
chronous/asynchronous), node whiteboards (FW/CW/NW), edge capacity (half- 
duplcx/full-duplcx) and anonymity of the agents. The details of the assumptions are 
presented in the next section. 

Outline. In Section 2, we present the computing model with various assumptions we 
consider in this paper. We also introduce the gossip problem and define the quiescence 
number of the gossip problem. Section 3 provides impossibility/possibility results in the 
model where each agent has a unique id. Section 4 briefly considers the quiescence numbers 
in the model of anonymous agents. Concluding remarks are presented in Section 5. 

2 Preliminaries 

Model. The network is modeled as a connected graph G = (V,E), where V is a set of 
nodes, and E is a set of edges. We assume that nodes are anonymous, that is, no node 
has a unique id and all the nodes with the same degree are identical. We also assume that 
nodes have local distinct labels for incident links, however no assumption is made about the 
labels. Each node also maintains a so-called whiteboard that can store a fixed amount of 
information. 

2 Minimizing communication after convergence in ordinary self-stabilizing solutions has been largely in- 
vestigated with silent [5] protocols, and has been recently tackled, e.g. in [3] for message passing systems 
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Agents (or robots) arc entities that move between neighboring nodes in the network. 
Each agent is modeled by a deterministic state machine. An agent staying at a node may 
change its state, leave some information on the whiteboard of the node, and move to one of 
the node's neighbors based on the following information: (i) the current state of the agent, 
(ii) the current states of other agents located at the same node, (Hi) the local link labels 
of the current node (and possibly the label of the incoming link used by the agent to reach 
the node), and (iv) the contents of the whiteboard at the node. In other words, the only 
way for two agents to communicate is by being hosted by the same node or by using node 
whiteboards. 

In this paper, we consider several variants of the model, which fall into several categories: 

1. Agent anonymity: we consider two variants, named agents and anonymous agents. 
Each agent has a unique identifier in the named agent model, and all agents are 
anonymous and identical in the anonymous agent model. 

2. Synchrony: we consider two variants, synchronous model and asynchronous model. 
In the synchronous model, all the agents are synchronized by rounds in the lock-step 
fashion. Every agent executes its action at every round and can move to a neighboring 
node. When two (or more) agents are at the same node, all the agents execute their 
actions in one round but in sequence. In the asynchronous model, there is no bound on 
the number of moves that an agent can make between any two moves of another agent. 
However, we assume that each agent is eventually allowed to execute its action. When 
two (or more) agents are at the same node, they execute their actions sequentially. 
However, agents located at different nodes may execute their actions concurrently. 

3. Link duplex capacity: we consider two variants, full-duplex links and half-duplex 
links. A link is full-duplex if two agents located at neighboring nodes can exchange 
their position at the same time, crossing the same link in opposite directions without 
meeting each other. A link is half-duplex if only one direction can be used at a given 
time 3 . 

4. Whiteboard capacity: we consider three distinct hypothesis for information stored 
in the nodes' whiteboards. In the NW (No Whiteboard) model, no information can 
be stored in the whiteboard. In the CW (Control Whiteboard) model, only control 
information can be stored in the whiteboard. In the FW (Full Whiteboard) model, 
any information can be stored in the whiteboard (including gossip information, defined 
later in this section). 

Of course, there is a strict inclusion of the hypotheses, and a solution that requires only 
e.g. the NW or the CW classes will work with the less restricted classes (CW and FW, 
and FW, respectively). Conversely if an impossibility result is shown for less restricted 
classes e.g. CW and FW, it remains valid in the more restricted classes (NW, and NW 
and CW, respectively). 

3 If two agents at the different ends of a half-duplex link try to migrate along the link simultaneously, 
only one of them succeeds to migrate. 
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The first set of hypotheses (or the agent anonymity) divides between Sections 3 and 4. 
In each section, the remaining hypotheses (synchrony, link duplex capacity, and whiteboard 
capacity) are denoted by a tuple. For example, "(Synch, FW, half )-mode\ v denotes the 
synchronous model with FW whiteboards and half-duplex links. The wildcard "*" in the 
triplet denotes all possibilities for the category. For example, "(*, FW, half )-model" denotes 
both the (Synch, FW, hal /)-model and the (Asynch, FW, half ')-model. 

Gossip problem specification. We consider the gossip problem among agents: agents 
are given some initial information (called gossip information), and the goal of a protocol 
solving the problem is that each agent disseminates its gossip information to every other 
agent in the system. Each agent can transfer the gossip information to another agent by 
meeting it at a node or by leaving the gossip information in the whiteboard of a node. In 
the latter case, a FW whiteboard is required. The gossip information can be relayed by 
other agents, that is, any agent that has already obtained the gossip information of another 
agent can transfer all collected gossip information to other agents and can store it in the 
whiteboard of a node. 

In this paper, we consider self- stabilizing solutions for the gossip problem. The solutions 
guarantee that every agent eventually knows the gossip information of all the agents in the 
system even when the system is started from an arbitrary configuration. So, agents may 
start from arbitrary locations in arbitrary states, and the nodes' whiteboards may initially 
contain arbitrary information. We assume that k agents are present in the network at any 
time, yet k is unknown to the agents. In the sequel, the k-gossip problem denotes the 
problem of gossiping among k agents. 

Quiescence number. We introduce the quiescence of solutions for the fc-gossip problem 
to describe the fact that some agents, although executing local code, stop moving at some 
point of any execution. 

Definition 1 A distributed algorithm for mobile agents is ^-quiescent (for some integer I ) 
if any execution reaches a configuration after which I ( or more ) agents remain still forever. 

Definition 2 The quiescence number of a problem is the maximum integer I such that 
a l-quiescent algorithm exists for the problem. For convenience, the quiescence number is 
considered to be —1 if there exists no O-quiescent algorithm (i.e., the problem is not solvable). 

Suzuki et al. [9, 10] considered the CW and FW whiteboard models, and showed that 
the difference does not impact the move complexity of non- stabilizing solutions for the gossip 
problem. In this paper, we clarify some differences among the NW, the CW and the FW 
whiteboard models with respect to the quiescence number of self-stabilizing solutions for the 
gossip problem. 
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3 Self-stabilizing /c-gossiping among named agents 

Our first result observes that the self-stabilization property of a fc-gossiping protocol implies 
that at least one mobile agent must keep moving forever in the system. 

Theorem 1 There exists no k-quiescent self- stabilizing solution to the k-gossip problem in 
the (*, *, *)-model. 

Proof sketch. For the fc-gossip problem to be self-stabilizing, there must not exist any 
terminal configuration of agents, i.e. a configuration from which all the agents never move 
thereafter. Assume the contrary, i.e. there are k agents (with k > 2) in a n-sized network 
(with n > k) such that agent and nodes are in a configuration such that all agents do not 
move thereafter. Then we construct a 2n-sized network by mirroring the first network and 
joining the two by a node with no agent on it (there exists such a node since n > k). By 
mirroring the states and agents as well, there exists a second set of agents in the second 
part of the network that will never move thereafter. Since agents may only communicate by 
meeting other agents at the same node or by using whiteboards, the new k agents from the 
second group are never able to communicate with any agent from the initial fc-sized group. 
As agents do not have the knowledge of the actual number of agents in the system, none of 
them is able to distinguish between the two systems, hence the result. 

The above discussion is valid independently of the assumptions concerning the synchrony, 
the link duplex capacity, or the whiteboard capacity. □ 

Notice that Theorem 1 does not hold if agents know the number k of existing agents. 
With the assumption of known k, it could be possible for agents to stop moving when k 
agents are located at a same node, i.e., fc-quiescence may be attainable if the rendez-vous 
of k agents is possible. We now show that in the asynchronous case, no self-stabilizing 
algorithm can ensure that at least one agent does not move forever. 

Theorem 2 There exists no l-quiescent self- stabilizing solution, for any I (1 < I < k — 1), 
to the k-gossip problem in (Asynch, *, *) -model. 

Proof sketch. Assume there exists a l-quiescent self-stabilizing solution, that is, in any 
execution there exists an agent that does not move after a certain configuration. The agent 
is only aware of the states of agents at the same node and the contents of the whiteboard 
at the node, and this information is sufficient to make the agent quiescent. Now consider 
that the graph is regular (i.e. all nodes have the same degree) and non trivial. So, we can 
construct a configuration in which every agent is quiescent. Since we consider asynchronous 
systems, a quiescent agent cannot start moving unless another agent reaches the node. As 
a result, the agents never meet with each other and the gossiping cannot be achieved. □ 

While Theorem 2 precludes /-quiescence in asynchronous models for any I (1 < I < k— 1), 
the impossibility result does not hold for synchronous systems. Actually, in synchronous 
arbitrary networks, we present in Algorithms 3.1, 3.2, 3.3, and 3.4 a positive result: a 
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(k — l)-quiescent self-stabilizing solution to the fc-gossip problem with CW whiteboards. 
The algorithm is based on the observation that gossiping can easily be achieved when a 
single agent repeatedly traverses the network: the agent alternates indefinitely a traversal 
to collect information and a traversal to distribute information. In our scheme, each agent 
may move according to a dcpth-first-traversal (DFT) in the network, and eventually an 
agent with minimal identifier (among all agents) keeps traversing forever, while other agents 
eventually stop. Since the network is synchronous, a stopped agent at node u waits for the 
traversal of the minimal identifier agent a bounded period of time, then starts moving if no 
such agent visits u within the bound. 

Each node v has variables InLink v and OutLink v in its whiteboard to store information 
about the DFT of each agent i. We assume for simplicity that v locally labels each incident 
link with an integer a (0 < a < A v — 1) where A„ is degree of v, and v[a) denotes the 
neighbor of v connected by the link labeled a. Variables InLink v and OutLink v have the 
following properties: 

• A tuple (i, a) (0 < a < A v — 1) in variable InLink v of node v implies that agent i 
visited v first from v[a] (i.e, v[a] is the parent of v in the depth-first-tree). A tuple 
(i, _L) in InLink v implies that i did not visit v yet, or that i completed the DFT 
part starting from v (and returned to the parent of v in the depth-first-tree). For 
the starting node of the DFT, is always stored in InLink v . We assume that 
only a single tuple of each agent i can be stored in InLink v (this can be enforced 
having InLink v implemented through an associative memory) and we consider that 
the absence of any tuple involving i denotes that (i, _L) is actually present. 

• A tuple (i,a) (0 < a < A v — 1) in variable OutLink v of node v implies that agent i 
left v for v[a] but did not return from v[a] (i.e., i is in the DFT starting from v[a\). 
A tuple (i, _L) in OutLink v implies that i did not visit v yet, or that i completed the 
DFT part starting from v (and returned to the parent of v in the depth-first-tree) . We 
assume the same additional constraints as for InLink v . 

In a legitimate configuration, tuples related to agent i in InLink v and OutLink v of all 
nodes induce a path from the starting node to the currently visited node. However, in an 
arbitrary initial configuration, InLink v and OutLink v may contain arbitrary tuples for agent 
i (several incomplete paths, cycles, no starting node, etc.). We circumvent this problem by 
having each agent executing DFTs repeatedly. In order to distinguish the current DFT from 
the previous one, each agent i maintains a boolean flag tJriti that is flipped when a new 
DFT is initiated. Each node v also maintains a variable TJable v to store tJaiti from the 
last visit of agent i in the form of a tuple (i,bit). For simplicity, we consider that (i,true) 
is in T_table v if no tuple of i is contained in T_table v . 

We now describe the mechanism to stop the remaining k — 1 agents. We assume that 
each node v maintains variables MinID v , WaitT v , and Waiting v in its whiteboard. The 
minimum id among all agents having visited v is stored in MinID Vl and the (computed) 
time required to complete a DFT is stored in WaitT v . The completion time of a DFT is 
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Algorithm 3.1 Protocol (Part 1: constants, variables and timers) 

constants of agent i 

i: id of i; 
constants of node v 

deg v : degree of v; 
local variables of agent i 

tJriti: bool; 

//an alternating bit to distinguish the current traversal from the previous one 
local variables of node v 

TJ,able v : set of tuples (id, tJbit) ; 

/ / (id, tJbit) implies the latest visit of agent id was done with the alternating bit tJrit 
InLink v : set of tuples (id, port); 

II (id, port) implies agent id first came from v[port] in the current traversal 

// For each id, only the tuple updated last is stored 

// (id, _L) is stored if v is the initial node of the traversal 

// (id, _L) is considered to be stored if no (id, *) is present 
OutLink v : set of tuples (id, port); 

II (id, port) implies agent id went out from v to v[port] last time it visited v 

II For each id, only the tuple updated last is stored 
MinID v : agent id; 

// the minimum id of the agents that have visited v 
WaitT v : int; 

// The amount of time agents with the non-minimum id should wait 
Waiting v : set of agents; 

// The set of agents waiting for timeout at v 
timers of node v 

Timer v : count -up timer; 

The timer value is automatically increased by one at every round 
functions on the local timer of node v 
reset(Timer v ) : Reset the timer value to 
read(Timer v ) : Return the timer value 



measured by the count-up timer Timer v of v as follows. Agent p with the minimum id 
repeatedly makes DFTs. When visiting v for the first time at each DFT, p sets the count-up 
timer of v to WaitT v and resets the timer. Eventually, p completes each DFT in 2m rounds, 
where m is the number of edges in the network, and WaitT v = 2m remains true thereafter. 
When visiting v, an agent p' finds a smaller id in MinID v and stays at v until the timer 
value of v reaches WaitT v . Since p eventually completes each DFT in 2m rounds, each agent 
other than p eventually remains at a node v (Vs timer is reset regularly enough to never 
expire). 
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Algorithm 3.2 Protocol (Part 2: Main behavior) 

Behavior of node v at each round 
for each arriving agent i do 

visit v (i); 
timeout ..check _and_execute v ; 



Lemma 1 Starting from any initial configuration, in every execution of Algorithms 3.1, 
3.2, 3.3, and 3.4, eventually the agent with the minimum identifier repeatedly depth-first- 
traverses the network. 

Proof sketch. Let p be the agent with the minimum id (among all the agents in the system). 
When p visits node v, if p < MinID v then MinID v = p is executed. Otherwise (i.e. 
when MinID v stores an identifier that is not the id of any existing agent), p suspends 
its DFT and waits for timeout at v (p is appended into Waiting v ). Since no agent with 
the fake id exists in the network, read(Timer v ) > WaitT v eventually holds (in function 
timeout -check Jind-execute v ) . When this is the case, MinID v — min{j | j G Waiting v }(= 
p) is executed and p resumes the suspended DFT. Once MinID v is changed to p, MinID v 
never stores an id smaller than p again. 

Now consider a DFT initiated by agent p with tJ>it p = b (b G {true, false}). In a 
legitimate configuration, p initiates a DFT from a node v satisfying (p, _L) G InLink v . 
However, in the initial configuration, (p, a) G InLink v may hold for some a (0 < a < A v — 1) 
where v is the node p is initially located at. We first show that p eventually terminates the 
DFT starting from such an initial configuration and initiates a new DFT with tJbit p = ->b. 
When p with tJbit p — b visits a node u in a forward move, p changes its tuple in T_table u to 
(p, b) if (p, b) TJable u . Otherwise, p backtracks. Since (p, b) in TJable u never changes to 
(p, ->b) as long as p continues the DFT with tJbitp = b, p can make at most m forward moves 
in the DFT. On the other hand, agent p backtracks from u to u[a] only when (p, a) G InLink u 
holds. When backtracking from u to u[a], p changes its tuple in InLink u to (p, _L). Thus, p 
can make at most n backtracking moves in the DFT. Consequently, p eventually terminates 
the DFT even when it starts the DFT from a node v with (p, _L) ^ InLink v . 

Now consider a DFT initiated by agent i with tjbiti = b at node v with (i, _L) G InLink v . 
Let G — (V',E') be a connected component containing v of G" b = (V~' b ,E~' b ) where 
V^ b = {u G V | (i, ~^b) G T_table u when i initiates the DFT} and E~~ b = (V~' b x V b ) n E. 
Since the algorithm can be viewed as a distributed version of a sequential DFT, it means 
i makes a DFT in G' and its outgoing edges (if they exist). When the DFT completes, 
the tuple of i stored in TJable u changes to (i,b) at each u in V, while the tuple of i 
stored in TJable w remains unchanged at w (^ V) during the DFT. Thus, if G is not the 
whole network, the connected component G" (similarly defined as G for the next DFT with 
t_biti — -i&) contains at least one more node than G . Since the network is finite, eventually 
i makes DFTs repeatedly over the whole network. □ 
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Algorithm 3.3 Protocol (Part 3: Behavior when agent i arrives at node v from v[a\) 
function visit v (i); 

1 1 Executed when agent i visits node v from v[a] (a can be any label initially 
if ((ijtJriti) T_table v ) { // first visit of i at v in the current traversal 
add (i,tJriti) to T_table v ; 
add (i, a) to InLink v ; 
if (i < MinlDy) { 
MinID v = i; 
WaitT v = read(Timer v ); 

reset(Timer v ); // Timer is reset to start measuring the traversal time 
if (deg v > 2) { 

add (i,next v (a)) to OutLink v ; // next v (a) = (a + 1) mod deg v 
migrate to v[next v {a)]; 

} 

else { // deg v = 1 then backtrack to v[a] 
add (i, _L) to InLink v ; 
migrate to v[a\; 

} 

} 

else 1 1 i> MinID v 

add i to Waiting^ ; 

} 

else if ((i,o) OutLink v ) 

II i previously visited v in the current traversal, i backtracks to v[a\. 
migrate to v[a\; 
1 1 The followings are the cases when i backtracks to v from v[a\. 
else if ((next v (a) == 0) and ((i, _L) € InLink v )) { 

II v is the initial node of i's traversal and i completes the current traversal 
if (i < MinID v ) { 

MinID v = i; WaitT v — read(Timer v ); reset(Timer v ); 

1 1 Initiate a new traversal 

tJbiti = —itjnti; 

add (i,tJbiti) to TJ,able v ; 

add (i, 0) to OutLink v ; 

migrate to v[0]; 

} 

else // i > MinID v 

add i to Waiting v ; 

} 

else if ((i, next v (a)) £ InLink v ) { 

II v is not the initial node of i's traversal, i completes the current traversal from v 

add (i, _L) to InLink v \ 

add (i, _L) to OutLink v ; 

migrate to u[nea;t l ,(a)]; // i backtracks 

} 

else { // i did not complete the current traverse from v 
add (i,next v (a)) to OutLink v ; 
migrate to v[next v (a)]; 
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Algorithm 3.4 Protocol (Part 4: Behavior when Timeout occurs) 

function timeout -check jmd-.execute v ; 

if read(Timer v ) > WaitT v ; { // Timeout occurs 
MinID v = min{j | j £ Waiting v }; 
Let i be such that MinID v = i; 
Waiting v = Waiting v — {i}; 

reset(Timer v ); / / Timer is reset to start measuring the traversal time 
if ((i, a) G InLinkv for some a (0 < a < deg v — 1)) { 
// v is not the initial node of i's traversal 
Let a be such that (i, a) G InLink v ; 
if (deg v > 2) { 

add (i,next v (a)) to OutLink v ; 
i migrates to v[next v (a)]; 

} 

else { // deg v = 1 then backtrack to v[a] 
add (i, T) to InLink v ; 
i migrates to v[a]; 

} 

else { // v is the initial node of i's traversal 
/ / Initiate a new traversal 
tJbiti = -itJbiU; 
add (i,tJbiti) to TJ,able v ; 
add (i, 0) to OutLinkv ; 
migrate to u[0]; 

} 

} 
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Theorem 3 The protocol defined by Algorithms 3.1, 3.2, 3.3, and 3.4 is a (k— l)-quiescent 
self- stabilizing solution to the k-gossip problem in arbitrary networks in the {Synch, CW, *)- 
model. 

Proof sketch. Let p be the agent with the minimum id. From Lemma 1, eventually p makes 
DFTs repeatedly over the whole network. Once p completes the DFT, MinID v never 
becomes smaller than p at any node v. 

Now consider p's DFT of the whole network that is initiated at a configuration satisfying 
MinID v > p at every node v. Then, p repeatedly performs DFTs without waiting at any 
node, and p completes each DFT in 2m rounds. This implies that timeout never occurs at 
any node starting from the second DFT. Any agent q other than p suspends its DFT when 
visiting any node u. Agent q can return to its suspended traverse only when timeout occurs 
at u. However, since timeout never occurs at u, q never returns to its suspended traverse 
and remains at u forever. □ 

To complete our results for the synchronous case, let us observe that in the (Synch, CW, *) 
and (Synch, FW, *) models, the quiescence number of the fe-gossip problem among named 
agents is A; — 1 (by Theorems 1 and 3). There remains the case of NW whiteboards, un- 
fortunately the following theorem show that when the memory of agents is bounded (the 
bound may depend on the network size n), the fc-gossip problem among named agents is 
not solvable. 

Theorem 4 The quiescence number of the (*, NW, *)-model is — 1 for the k-gossip problem 
among named agents, when state space of each agent is bounded (but may depend on the 
network size n). 

Proof sketch. We prove the impossibility for synchronous ring networks. We assume for the 
purpose of contradiction that a O-quiescent solution exists, and that each agent has at most 
s states. Notice that s is not necessarily a constant and may depend on the network size. 

We consider system executions where each agent starts its execution from a predeter- 
mined state. Since no information can be stored in the whiteboards (model NW), the 
behavior of an agent depends solely on its own state and id (the network being regular). 
When an agent executes an action, it changes its state then (potentially) moves (clockwise 
or counterclockwise). Since each agent has at most s states, it repeats a cyclic execution of 
at most length s unless the agent meets another agent. Since only three kinds of moves are 
possible, there exists at most 3 S+1 moving patterns in the cyclic behavior of length s or less. 
Now we consider a sufficiently large domain of agent identifiers (e.g. k x 3 S+1 ). All possible 
agents are partitioned into at most 3 S+1 groups depending on their moving patterns, and 
thus, some group contains k or more agents. Now consider k agents in the group of size k or 
more, that are placed regularly in different nodes in the initial configuration of the nodes. 
Since agents in the group makes the same moving pattern in the cycle, the agents repeat 
the cyclic action without meeting each other in the synchronous execution. In the models 
with the whiteboards NW, the gossiping cannot be achieved without meetings of agents, 
which is a contradiction. □ 
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Note that the impossibility result holds even though the agents all start from a well 
known predefined initial state. Thus, if the initial location of agents is not controlled, even 
non-stabilizing solution are impossible to design. For asynchronous models, the remaining 
question is about the possibility of O-quiescence. 

Theorem 5 The quiescence number of (Asynch, CW, full) and (Asynch, NW, full) model 
is —1 for the k-gossip problem among named agents. 

Proof sketch. We show that there exists no O-quiescent self-stabilizing solution to the fc-gossip 
problem in (Asynch, CW, full)-model. Let us assume for the purpose of contradiction that 
there exists a O-quiescent self-stabilizing solution. All k agents must keep moving in the 
O-quiescent solution since 1-quiescence is impossible from Theorem 2. 

Now consider a particular agent p. In the asynchronous system with full-duplex links, 
there exists an execution such that p never meets any other agent: before p reaches a node 
u, all the agents staying at u leave u. Notice that full-duplex links allow the agents to leave 
u without meeting p: scheduling allows to have all agents exiting u by the same link used 
by p to arrive at u to be moving concurrently with p. It follows that in the execution, agent 
p cannot disseminate its own information (agents have to meet one another in C W model) . 
Hence the result. □ 

Theorem 6 The quiescence number of (Asynch, FW, *)-model is for the k-gossip problem 
among named agents. 

Proof sketch. Theorems 2 shows that 1-quiescence is impossible. Thus, it is sufficient to 
present a O-quiescent self-stabilizing solution in (Asynch, FW, *)-model. 

Consider the following protocol outline. Every agent repeatedly performs DFTs of the 
network. When an agent visits a node, it stores its gossip information in the whiteboard and 
collects the gossip information stored in the whiteboard. After a DFT has been completed 
by every agent, all whiteboards contain the gossip information of all the agents, and every 
agent can obtain all the gossip information by performing an additional DFT. 

The self-stabilizing DFT can be realized in the same way as the protocol presented in 
Theorem 3: each agent simply behaves as the agent with the minimum id of the protocol, 
yet does not need to wait at any node. □ 

Theorem 7 The quiescence number of (Asynch, CW, half ) -model is for the k-gossip 
problem among named agents. 

Proof sketch. Theorems 2 shows that 1-quiescence is impossible to attain. Thus, it is 
sufficient to present a O-quiescent self-stabilizing solution in (Asynch, CW, half )-model. 

Consider the following protocol outline. Every agent repeatedly performs DFTs of the 
network while recording at every traversed node the last targeted neighboring node. By 
the recorded information, other agents can trace a particular agent. When an agent visits 
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a node and finds a smaller id than its own, it starts tracing the agent with the smaller id. 
Eventually all the agent other than agent p (that has minimal id) continue tracing p, that 
in turns perform a DFT forever. Since we assume the half-duplex edges, agents cannot miss 
one another on a link, and agents perform the same DFT and the agent with minimal id. 
Then a similar argument as in [2] implies that all agents other than p meet p infinitely often. 
Thus, by means of agent p, every agent can disseminate its gossip information to all other 
agents. 

The self-stabilizing DFT can be realized in the same way as the protocol presented in 
Theorem 3. The only difference is in the way to detect the fake ids. In the protocol of 
Theorem 3, fake ids are detected by a timeout mechanism. Here, each agent records at each 
node the distance from the starting node in the depth-first-tree. In any trace labeled with a 
fake id, the tracing agent eventually detects contradiction in the distances and then decides 
that the traced id is a fake one. Agent p detecting a fake id erases the false records on the 
path p traced. □ 



4 Self-stabilizing /c-gossip among anonymous agents 

Named agents being a stronger assumption than anonymous agents, all the impossibility 
results for named agents also hold for anonymous agents. In this section, we consider only 
the model variations that the impossibility results for named agents do not cover. 

Theorem 8 The quiescence number of (*, CW,*)-model is —1 for the k-gossip problem 
among anonymous agents. 

Proof sketch. Consider a synchronous ring network where all the whiteboards of nodes 
contain the same initial information. Assume that all the agents are in the same state in 
the initial configuration. In the synchronous system, all the agents move exactly the same 
and they never meet each other, and thus, the gossiping cannot be completed. □ 

Theorem 9 The quiescence number of (Asynch, FW, *)-model is for the k-gossip problem 
among anonymous agents. 

Proof sketch. From Theorem 2, it is sufficient to present a O-quiescent self-stabilizing solution 
to the gossip problem in (Asynch, FW, *)-model. 

Since the whiteboards FW is available, the fc-gossiping can be completed if every agent 
repeatedly traverses the network. However, an anonymous agent cannot record at a node 
that it has visited the node since its record cannot be distinguished from that of others. 
Thus, anonymous agents cannot execute the DFT like the ones in Theorem 3. Instead, 
each agent can traverse all the paths of a given length, say I, using the link labels (i.e., 
traverse all the paths in the lexicographic order of the label sequences) . When completing 
the traverse of the paths of length I, the agent starts traversing the paths of length I + 1. 
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By repeating the traverses with incrementing the length, eventually the agent can traverse 
the whole network. □ 

For the synchronous anonymous agents, Theorem 9 guarantees that the quiescence num- 
ber is at least 0. On the other hand, the impossibility of /c-quiescence for synchronous named 
agents (Theorem 1) leads to the following theorem. 

Theorem 10 The quiescence number of (Synch, FW, *)-model is not larger than k — l and 
not smaller than for the k-gossip problem among anonymous agents. □ 

5 Conclusion 

This paper introduced the notion of quiescence for mobile agent protocols in a self-stabilizing 
setting. This notion complements the notion of silence [5] used in "classical" self-stabilizing 
protocols. While /c-quiescence of fc-gossiping among named agents is easily attainable in 
non- stabilizing solutions (assuming FW and CW whiteboards) [9, 10], this paper shows 
that self- stabilization prevents fc-quiescent solutions in any considered model, and even 0- 
quiescent solutions in some particular models. Thus, our paper shed new light on the 
inherent difference between non-stabilizing and self-stabilizing solutions of agent-based sys- 
tems. 

We would like to point out interesting open questions: 

1. What is the exact quiescence number of the (Synch, FW, *)-model for the £:-gossip 
problem among anonymous agents? (besides being not smaller than and not larger 
than k—l) 

2. What is the connection between the quiescence number and the topology ? 

3. Does there exist a non-trivial non- stabilizing problem with quiescence number lower 
than k ? 
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